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Depth first search (DFS) tree is a fundamental data structure for solving various problems in graphs. It 
is well known that it takes 0{m + n) time to build a DFS tree for a given undirected graph G = {V, E) on 
n vertices and m edges. We address the problem of maintaining a DFS tree when the graph is undergoing 
updates (insertion and deletion of vertices or edges). We present the following results for this problem. 

1. Fault tolerant DFS tree: 

There exists a data structure of size 0(m) [^such that given any set E of failed vertices or edges, a 
DFS tree of the graph G\E can be reported in 0{n\F\) time. 

2. Fully dynamic DFS tree: 

There exists a fully dynamic algorithm for maintaining a DFS tree that takes worst case 0{^mn) 
time per update for any arbitrary online sequence of updates. 

3. Incremental DFS tree: 

Given any arbitrary online sequence of edge insertions, we can maintain a DFS tree in 0{n) worst 
case time per edge insertion. 

These are the first o(m) worst case time results for maintaining a DFS tree in a dynamic environment. 
Moreover, our fully dynamic algorithm provides, in a seamless manner, the first deterministic algorithm with 
0(1) query time and o{m) worst case update time for the dynamic subgraph connectivity, biconnectivity, 
and 2-edge connectivity. 
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1 Introduction 


Depth First Search (DFS) is a well known graph traversal technique. Right from the seminal work of Tarjan 
1^ . DFS traversal has played the central role in the design of efficient algorithms for many fundamental 
graph problems, namely, biconnected components, strongly connected components, topological sorting |[39l . 
bipartite matching |[28l . dominators in directed graph BOl and planarity testing ||29]| . 

Let G = (L, i?) be an undirected connected graph on n = \V\ vertices and m = \E\ edges. DFS 
traversal of G starting from any vertex r £ V produces a rooted spanning tree, called a DFS tree with r 
as its root. It takes 0(m + n) time to perform a DFS traversal and generate a DFS tree. Given any rooted 
spanning tree of graph G, all non-tree edges of the graph can be classified into two categories, namely, 
back edges and cross edges as follows. A non-tree edge is called a back edge if one of its endpoints is an 
ancestor of the other in the tree. Otherwise, it is called a cross edge. A necessary and sufficient condition 
for any rooted spanning tree to be a DFS tree is that every non-tree edge is a back edge. Thus, it can be seen 
that many DFS trees are possible for any given graph. However, if the traversal of the graph is performed 
according to the order specified by the adjacency lists of the graph, the resulting DFS tree will be unique. 
The ordered DFS tree problem is to compute the order in which the vertices get visited when the traversal is 
performed strictly according to the adjacency lists. 

Most of the graph applications in real world deal with graphs that keep changing with time. These 
changes/updates can be in the form of insertion or deletion of vertices or edges. An algorithmic graph 
problem is modeled in a dynamic environment as follows. There is an online sequence of updates on the 
graph, and the objective is to update the solution of the problem efficiently after each update. In particular, 
the time taken to update the solution has to be much smaller than that of the best static algorithm for 
the problem. In the last two decades, many elegant dynamic algorithms have been designed for various 
graph problems such as connectivity iflTl 12615711^ . reachability Il35ll37ll . shortest path |[T4ll36l . spanners 
||71|22l|34l, and min-cut Il42ll . Another, and more restricted, variant of a dynamic environment is the fault 
tolerant environment. Here the aim is to build a compact data structure, for a given problem, that is resilient 
to failures of vertices/edges and can efficiently report the solution for a given set of failures. There has been 
a lot of work in the last two decades on fault tolerant algorithms for connectivity llT0l[T^l20ll . shortest paths 
l|6lll2l|T5l, and graph spanners ifSlITTIl. 

A dynamic graph algorithm is said to be fully dynamic if it handles both insertion as well as deletion 
updates. A partially dynamic algorithm is said to be incremental or decremental if it handles only insertion 
or only deletion updates respectively. In this paper, we address the problem of maintaining a DFS tree 
efficiently in any dynamic environment. 

1.1 Existing results on dynamic DFS 

In spite of the simplicity of a DFS tree, designing any efficient parallel or dynamic algorithm for a DFS 
tree has turned out to be quite challenging. Reif |[32ll showed that the ordered DFS tree problem is a P- 
Complete problem. For many years, this result seemed to imply that the general DFS tree problem, that 
is, the computation of any DFS tree is also inherently sequential. However, Aggarwal and Anderson Q 
proved that the general DFS tree problem is in RNC by designing a parallel randomized algorithm that takes 
0(log^ n) time. Further, the fastest parallel deterministic algorithm for general DFS tree takes 0{y/n) time 
1311211. Whether the general DFS tree problem is in NC is still a long standing open problem. 

Reif ||33]| and later Miltersen et al. OTl proved that P-Completeness of a problem also implies hardness 
of the problem in the dynamic setting. The work of Miltersen et al. lOTll shows that if the ordered DFS 
tree is updateable in O(polylog(n)) time, then the solution of every problem in class P is updateable in 
O(polylog(n)) time. In other words, maintaining the ordered DFS tree is indeed the hardest among all 
the problems in class P. In our view, this hardness result, which is actually for only the ordered DFS tree 
problem, has proved to be quite discouraging for the researchers working in the area of dynamic algorithms. 


2 


This is evident from the fact that for all the static graph problems that were solved using DFS traversal in 
the 1970’s, none of their dynamic counterparts used a dynamic DFS tree ll^lZTlI^I^ IOl fTOlfT^ . 

Apart from the hardness of the ordered DFS tree problem in dynamic environment, very little progress 
has been achieved even for the problem of maintaining any DFS tree. Franciosa et al. ifT^ designed an 
incremental algorithm for a DFS tree in a DAG. For any arbitrary sequence of edge insertions, this algorithm 
takes 0{mn) total time to maintain a DFS tree from a given source. Recently Baswana and Choudhary 0 
designed a decremental algorithm for a DFS tree in DAG that requires expected 0{mn log n) total time. For 
undirected graphs, recently Baswana and Khan |!5l| designed an 0{n‘^) total time incremental algorithm for 
maintaining a DFS tree. These algorithms are the only results known for the dynamic DFS tree problem. 
Moreover, none of these existing algorithms, though designed for only a partially dynamic environment, 
achieves a worst case bound of o{m) on the update time. So the following intriguing questions remained 
unanswered till date: 

• Does there exist any fully dynamic algorithm for maintaining a DFS tree? 

• Is it possible to achieve worst case o{m) update time for maintaining a DFS tree in a dynamic envi¬ 
ronment? 

Not only do we answer these open questions affirmatively for undirected graphs, we also use our dy¬ 
namic algorithm for DFS tree to provide efficient solutions for a couple of well studied dynamic graph 
problems. Moreover, our results also handle vertex updates which are generally considered harder than edge 
updates. 

1.2 Our results 

We consider a generalized notion of updates wherein an update could be either insertion/deletion of a vertex 
or insertion/deletion of an edge. For any set U of such updates, let G -|- 17 denote the graph obtained after 
performing the updates U on the graph G. Our main result can be succinctly described in the following 
theorem. 

Theorem 1.1 An undirected graph can be preprocessed to build a data structure of 0{m\ogn) size such 
that for any setUofk < n updates, a DFS tree ofG + U can be reported in 0{nk log"^ n) time. 

With this result at the core, we easily obtain the following results for dynamic DFS tree in an undirected 
graph. 

1. Fault Tolerant DFS tree: 

Given any set of k failed vertices or edges and any vertex v £ V, v/e can report a DFS tree rooted at 
V for the resulting graph in 0{nk log^ n) time. 

2. Fully Dynamic DFS tree: 

Given any arbitrary online sequence of vertex or edge updates, we can maintain a DFS tree in 
0{^mn log^'^ n) worst case time per update. 

3. Incremental DFS tree: 

Given any arbitrary online sequence of edge insertions, we can maintain a DFS tree in 0(n log^ n) 
worst case time per edge insertion. 

These are the first o{m) worst case update time algorithms for maintaining a DFS tree in a dynamic 
environment. Recently, there has been significant work (I] |23l on establishing conditional lower bounds on 
the time complexity of various dynamic graph problems. A simple reduction from [1], based on the Strong 
Exponential Time Hypothesis (SETH), implies a conditional lower bound of D(n) on the update time of any 
fully dynamic algorithm for a DES tree under vertex updates. We also present an unconditional lower bound 
of D(n) for maintaining a fully dynamic DES tree explicitly under edge updates. 
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1.3 Applications of Fully Dynamic DFS 

In the static setting, a DFS tree can be easily used to answer connectivity, 2-edge connectivity and bicon¬ 
nectivity queries. Our fully dynamic algorithm for DFS bee thus seamlessly solves these problems for both 
vertex and edge updates. Further, our result gives the first deterministic algorithm with 0(1) query time 
and o{m) worst case update time for several well studied variants of these problems in the dynamic setting. 
These problems include dynamic subgraph connectivity ifTOl IT^ flTl [191 l27l [33 and vertex update versions 
of dynamic biconnectivity ll25l l24l l27l and dynamic 2-edge connectivity 1271 ITTl [T9l . The existing results 
offer different trade-offs between the update time and the query time, and differ on the types (amortized or 
worst case) of update time and the types (deterministic or randomized) of query time. Our algorithm, in 
particular, improves the deterministic worst case bounds for these problems, thus emphasizing the relevance 
of DFS trees in solving dynamic graph problems. 

1.4 Main Idea 

Let T be a DFS tree of G. To compute a DFS tree of G -|- (7 for a given set U of updates, the main idea is 
to make use of the original tree T itself. We preprocess the graph G using tree T to build a data structure V. 
In order to achieve o(m) update time, our algorithm makes use of V to create a reduced adjacency list for 
each vertex such that performing DFS traversal using these lists gives a DFS tree for G + U.\n fact, these 
reduced adjacency lists are generated on the fly and are guaranteed to have only 0{n\U\) edges. 

We now give an outline of the paper. In section|^ we describe the various notations used throughout the 
paper. Sectionj^describes an algorithm to report the DFS tree after a single update in the graph. The details 
of the required data structure V are described in Section Then in Section we provide an overview of 
our algorithm for handling multiple updates, highlighting the main intuition behind our approach. Our main 
result (Theorem I l.lj ) that reports a DFS tree after any set of updates in the graph is described in Section]^ 
In Section]^ we convert this algorithm to fully dynamic and incremental algorithms for maintaining a DFS 
tree using the overlapped periodic rebuilding technique. Finally, in Section]^ and Section [T^ we describe 
the applications and lower bounds of dynamic DFS. 

2 Preliminaries 

Let U be any given set of updates. We add a dummy vertex r to the given graph in the beginning and connect 
it to all the vertices. Our algorithm starts with any arbitrary DFS tree T rooted at r in the augmented graph 
and it maintains a DFS tree rooted at r at each stage. It can be observed easily that each subtree rooted at 
any child of r is a DFS tree of a connected component of the graph G + U. The following notations will be 
used throughout the paper. 

• T{x) : The subtree of T rooted at vertex x. 

• path{x, y) : Path from the vertex x to the vertex y in T. 

• distxix, y) : The number of edges on the path from x to y in T. 

• LGA{x, y) : The lowest common ancestor of x and y in tree T. 

• N{w) : The adjacency list of vertex w in the graph G + U. 

• L{w) : The reduced adjacency list of vertex w in the graph G + U. 

• T* : The DFS tree rooted at r computed by our algorithm for the graph G + U. 

• par{w) : Parent of m in T*. 
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A subtree T' is said to be hanging from a path p if the root r' of T' is a child of some vertex on the 
path p and r' does not belong to the path p. Unless stated otherwise, every reference to a path refers to an 
ancestor-descendant path defined as follows: 

Definition 2.1 (Ancestor-descendant path) A path p in a DFS tree T is said to be ancestor-descendant 
path if its endpoints have ancestor-descendant relationship in T. 

We now state the operations supported by the data structure V (complete details of V are in Section 
1^. Let U below refer to a set of updates that consists of vertex and edge deletions only. For any three 
vertices w,x,y G T, where path{x, y) is an ancestor-descendant path in T the following two queries can 
be answered using V in 0(log^ n) time. 

1. Query{w, x, y) : among all the edges from w that are incident on path{x, y)mG-\- U, return an edge 
that is incident nearest to x on path{x, y). 

2. Query{T{w), x, y) : among all the edges from T{w) that are incident on path{x, y)mG -\- U, return 
an edge that is incident nearest to x on path{x, y). 


3 Handling a single update 

DFS traversal has the following flexibility : when the traversal reaches a vertex, say v, the next vertex to 
be traversed can be any unvisited neighbor of v. In order to compute a DFS tree for G -|- (7 efficiently, our 
algorithm exploits this flexibility, the original DFS tree T, and the following property of DFS traversal. 



Figure 1: Edges e'^ as well as can be ignored during the DFS traversal. 


Lemma 3.1 (Components Property) Let T* be the partially grown DFS tree and v be the vertex currently 
being visited. Let G be any connected component in the subgraph induced by the unvisited vertices. Suppose 
two edges e and e' from G are incident respectively on v and some ancestor (not necessarily proper) w of v 
in T*. Then it is sufficient to consider only e during the rest of the DFS traversal, i.e., the edge F need not 
be scanned. (Refer to Figure^. 

Skipping e' during the DFS traversal, as stated in the components property, is justified because e' will 
appear as a back edge in the resulting DFS tree. In order to highlight the importance of components property, 
and to motivate the requirement of data structure D, we first consider a simpler case which deals with 
reporting the DFS tree after a single update in the graph. 

Consider the failure of a single edge (6, /) (refer to Figure (i)). Exploiting the flexibility of DES 
traversal, we can assume a stage in the DES traversal of G\{(6, /)} where the partial DES tree T* is T\T{f ) 
and the vertex b is currently being visited. Thus, the unvisited graph is a single connected component 
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containing the vertices of T(/). Now, according to components property we need to process only the lowest 
edge from T(/) to path{b, r) {{k, b) in Figure|^(ii)). Hence, the DFS traversal enters this component using 
the edge {k, b) and performs DFS traversal in the subgraph induced by the vertices of T(/). The resulting 
DFS tree of this subgraph would now be rooted on k. Rebuilding the DFS tree after failure of the edge (6, /) 
thus reduces to finding the lowest edge from T{f) to path{e, r) and then rerooting a subtree T{f) of T on 
the new root k. We now describe how this rerooting can be performed in 0{n) time in the following section. 



Figure 2: (i) Failure of edge (6, /). (ii) Partial DFS tree T* with unvisited graph T(/), component property 
allows us to neglect (a,/), (iii) Augmented path{k,f) to T*, components property allows us to neglect 
{I, k). (iv) Final DFS tree of G\{{b, /)}. 


3.1 Rerooting a DFS tree 

Given a DFS tree T originally rooted at ro and a vertex r', the aim is to compute a DFS tree of the graph 
that is rooted at r'. Note that any subtree T{x) of the DFS tree T is essentially the DFS tree of the subgraph 
induced by the vertices of T{x). Hence, the same procedure can be applied to reroot a subtree T{x) of the 
DFS tree T. Thus in general we aim at rerooting T(ro) to a new root r' (see Figure|^(ii), where the subtree 
T(/) would be rerooted to the new root k). 


Procedure Reroot(T(ro), r'): Reroots the subtree T{ro) of T to be rooted at the vertex r'. 

1 foreach 

[a,b) on path{rQ,r') do /* a = par{b) in original tree T{rQ). */ 

2 

par{a) ■<— 6; 

3 

foreach child cofb not on path{ro, r') do 

4 


{u,v) Query(T{c),rQ,b) /* where u £ path{rQ,r') and vGT{c). */ 

5 


if {u, v) is non-null then 

6 



Reroot {T{c),v)\ 

7 



par{v) £- w, 

8 


end 

9 

end 


10 end 




Figure 3: The recursive algorithm to reroot a DFS tree T(ro) from the new root r'. 


Our algorithm (refer to Procedure Reroot I essentially guides the DFS traversal (exploiting the flexibility 
of DFS) such that components of the unvisited graph can be easily identified. The components property can 
then be applied to each such component, processing only 0(n) edges to compute the rerooted DFS tree. 
The DFS traversal first visits the path from r' to the root of tree T{ro). This reverses path{ro, r') in the 
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new DFS tree T* as now r' would be an ancestor of tq (see Figure (iii)). Now, each subtree hanging 
from path{r', ro) in T forms a component of the unvisited graph. This is because, presence of any edge 
between these subtrees would imply a cross edge in the original DFS tree. Using the components property 
we know that for each of these subtrees (say Tj) we only need to process the lowest edge from Tj on the 
new path from r' to tq in T*. Since path{r', ro) is reversed in T*, it is equivalent to processing the highest 
edge ei from Ti to the path{ro, r') in T. Recall that this query can be answered by our data structure V in 
0(log^ n) time (refer to Section^. Now, let Vi be the end vertex of in Tj. The DFS traversal will thus 
visit the component (having vertices of Ti) through a and produce a DFS tree of the subgraph induced by 
Ti that is rooted on Vi. This rerooting can be performed by invoking the rerooting procedure recursively on 
the subtree T with the new root Vi. 


We now analyze the total time required by Procedure Reroot to reroot a subtree T' of the DFS tree T. 
The total time taken by our algorithm is proportional to the number of edges processed by the algorithm. 
These edges include the tree edges that were a part of the original tree T' and the added edges that are 
returned by the data structure T>. Clearly, the number of tree edges in T' are 0(|T'|). Also since the added 
edges eventually become a part of the new DFS tree T*, they too are bounded by the size of the tree T'. 
Further, the data structure V takes 0(log^ n) time to report each added edge. Hence the total time taken 
by our algorithm to rebuild T' is 0{\T'\ log^ n) time. Since V can be built in 0(mlog n) time (refer to 
Theorem |4.1 1 in the Section]^, we have the following theorem. 

Theorem 3.1 An undirected graph can be preprocessed to build a data structure in 0(m log n) time, such 
that any subtree T' of the DFS tree can be rerooted at any vertex in T' in 0{\T'\ log^ n) time. 


We now formally describe how rebuilding a DFS tree after an update can be reduced to this simple 
rerooting procedure (see Figure]^. 

1. Deletion of an edge {u,v)\ 

In case {u, v) is a back edge in T, simply delete it from the graph. Otherwise, let u = par{v) in 
T. The algorithm finds the lowest edge («', v') on the path{u, r) from T{v), where v' G T{v). The 
subtree T{v) is then rerooted to the new root v' and hanged from u' using [u', v') in the final tree T*. 

2. Insertion of an edge (tt, v)\ 

In case (tt, v) is a back edge, simply insert it in the graph. Otherwise, let w be the LCA of u and v in 
T and v' be the child of w such that v G T{v'). The subtree T{v') is then rerooted to the new root v 
and hanged from u using {u, v) in the final tree T*. 


3. Deletion of a vertex u: 

Let vi, ...,Vcb>e. the children of u in T. For each subtree T{vi), the algorithm finds the lowest edge 
(u', v'f) on the path{par{u),r) from T{vi), where v[ G T{vi). Each subtree T{vi) is then rerooted to 
the new root v[ and hanged from u[ using {u[, v[) in the final tree T*. 


4. Insertion of a vertex u: 

Let ui,..., tic be the neighbors of u in the graph. Make u a child of some Vj in T*. For each Vi, such 
that Vi path{vj,r), let T{v[) be the subtree hanging from path{vj,r) such that Vi G T{v'fj. Each 
subtree T(ti') is then rerooted to the new root Vi and hanged from u using (ti, Vi) in the final tree T*. 


In case of vertex updates, multiple subtrees may be rerooted by the algorithm. Let these subtrees be 
Ti, ...jTc. Thus, the total time taken by our algorithm is equal to the time taken to reroot the subtrees 
Ti, ...jTc. Using Theorem 3.1 we know that a subtree T' can be rerooted in 0(|T'|) time. Since these 
subtrees are disjoint, the total time taken by our algorithm to build the resulting DES tree is 0(|Ti| + ... + 
I Tel) = 0{n). Thus we have the following theorem. 


Theorem 3.2 An undirected graph can be preprocessed to build a data structure in 0{mlogn) time such 
that after a single update in the graph, the DFS tree can be reported in 0{n log^ n) time. 
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(i) 





(iv) 


Figure 4: Updating the DFS tree after a single update: (i) deletion of an edge, (ii) insertion of an edge, (iii) 
deletion of a vertex, and (iv) insertion of a vertex. The algorithm reroots the marked subtrees (marked in 
violet) and hangs it from the inserted edge (in case of insertion) or the lowest edge (in case of deletion) on 
the marked path (marked in blue) from the marked subtree. 


4 Data Structure 

The efficiency of our algorithm relies heavily on the data structure V. For any three vertices w,x,y G T, 
where path{x, y) is an ancestor-descendant path in T, we need to answer the following two kinds of queries. 

1. Query{w, x, y) : among all the edges from w that are incident on path{x, y) inG + U, return an edge 
that is incident nearest to x on path{x, y). 

2. Query{T{w), x, y) : among all the edges from T{w) that are incident on path{x, y)mG + U, return 
an edge that is incident nearest to x on path{x, y). 

We now describe construction of the data structure V. It employs a combination of two well known 
techniques, namely, heavy-light decomposition ll^ and suitable augmentation of a binary tree (segment 
tree) as follows. 

1. Perform a pre-order traversal of the tree T with the following restriction: Upon visiting a vertex 
V £ T, the child of v that is visited first is the one storing the largest subtree. Let C be the list of 
vertices ordered by this traversal. 

2. Build a segment tree Tb whose leaf nodes from left to right represent the vertices in list C. 

3. Augment each node z oiTs with a binary search tree £{z), storing all the edges {u, v) £ E where u 
is a leaf node in the subtree rooted at z in 7 b . These edges are sorted according to the position of the 
second endpoint in C. 

The construction of V described above ensures the following properties which are helpful in answering 
a query Query{T{w),x, y) (see Figure]^. 

• T{w) is present as an interval of vertices in £ (by step 1). Moreover, this interval can be expressed as 
a union of O(logn) disjoint subtrees in Tb (by step 2). Let these subtrees be Ti,..., Tg. 

• It follows from the heavy-light decomposition used in step 1 that path path{x, y) can be divided into 
O(logn) subpaths path{xi,yi),... ,path{xe, yt) such that each subpath path{xi, yi) is an interval 
in C. 
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DFS tree T 



Figure 5: (i) The highest edge from subtree T{w) on path{x, y) is edge (x, s) and the lowest edges are edge 
(z, w) and {z, t). (ii) The vertices of r(tt;) are represented as union of two subtrees in segment tree Tg. 


• Let Zj be the root of subtree Tj in Tg. Then it follows from step 3 that any query Query{Tj,Xi, yi) 
can be answered by a single predecessor or successor query on the BST £{zj) in O(logn) time. 

To answer Query(T{w),x, y), we thus find the edge closest to x among all the edges reported by the 
queries {Query{Tj, Xi,yi)\l < j < q and 1 < f < ^}. Thus, Query{T{w),x,y) can be answered in 
0(log^ n) time. Notice that Query{w, x, y) can be considered as a special case where q = 1 and Ti is the 
leaf node of Tb representing w. The space required by T> is 0(m log n) as each edge is stored at O(logre) 
levels in Tb- Now, the segment tree Tb can be built in linear time. Further, for every node u, the sorted list 
of edges in £{u) can be computed in linear time by merging the sorted lists of its children. Thus, the binary 
search tree £{u) for each node u £ Tb can be built in time linear in the number of edges m£{u). Hence the 
total time required to build this data structure is 0{m log n). Thus, we have the following theorem. 

Theorem 4.1 The queries Query{T{w),x, y), Query{w, x, y) on T can be answered in 0(log^ n) worst 
case time using a data structure T) of size 0{m log n), which can be build in 0{m log n) time. 

Note: Our algorithm also requires deletion of edges from V. An edge can be deleted from V by deleting 
the edge from the binary search tree stored at its end points and their ancestors in Tb- Since a deletion in 
binary search tree takes 0(log n) time, an edge can be deleted from V in 0(log^ n) time. 


5 Handling multiple updates - Overview 


DFS tree can be computed in 0{n) time after a single update in the graph, by reducing it to Procedure 


Reroot However, the same procedure cannot be directly applied to handle a sequence of updates because of 
the following reason. The efficiency of Procedure |Reroot| crucially depends on the data structure V, which 
is build using the DFS tree T of the original graph. Thus, when the DFS tree is updated, we are required to 
rebuild V for the updated tree. Now, rebuilding V is highly inefficient because it requires 0{m log n) time. 
Thus, in order to handle a sequence of updates, our aim is to use the same V for handling multiple updates, 
without having to rebuild it after every update. We now give an overview of the algorithm that reports the 
DFS tree after a set U of updates. 

In case of single update, all the edges reported by V are added to the final DFS free T*. However, while 
handling multiple updates, we use V to build reduced adjacency list for vertices of the graph, such that the 
DFS traversal of the graph using these sparser lists gives the DFS tree of the updated graph. Now, the data 
structure V finds the lowest/highest edge from a subtree of T to an ancestor-descendant path of T. Thus, in 
order to employ V to report DFS tree of G + U, we need to ensure that the queried subtrees and paths do 
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not contain any failed edges or vertices from U. Hence, for a set U of updates, we compute a partitioning of 
T into a disjoint collection of ancestor-descendant paths and subtrees such that none of these subtrees and 
paths contain any failed edge or vertex. An important property of this partitioning is that there are no edges 
from G lying between any two subtrees in this partitioning. We refer to this partitioning as a disjoint tree 
partitioning. Note that this partitioning depends upon only the vertex and edge failures in the set U. 

Recall that during the DFS traversal we need to find the lowest edge from each component C of the 
unvisited graph. It turns out that any component C can be represented as a union of subtrees and ancestor- 
descendant paths of the original DFS tree T. The components property can now be employed to compute 
the reduced adjacency lists of the vertices of the graph as follows. We just find fhe lowesf edge from each 
of fhe subfrees and the ancestor-descendant paths to T* by querying the data structure V. Let this edge 
be (x, y) where x G T* and y £ C. We can just add y to the reduced adjacency list L{x) of x. Since 
components property ensures the remaining edges to T* can be ignored, the DFS traversal would thus 
consider all possible candidates for the lowest edge from every component C to T*. Let the initial disjoint 
tree partitioning consists of a set of ancestor-descendant paths V and a set of subtrees T. The algorithm for 
computing a DFS tree of G + U can be summarized as follows: 

Perform the static DFS traversal on the graph with the elements ofVUT as the super vertices. Visiting a 
super vertex v* by the algorithm involves extracting an ancestor-descendant path pQfrom v* and attaching it 
to the partially grown DFS tree T*. The remaining part ofv* is added back to VUT as new super vertices. 
Thereafter, the reduced adjacency lists of the vertices on path po are computed using the data structure D. 
The algorithm then continues to find the next super vertex using the reduced adjacency lists and so on. 

6 Disjoint Tree Partitioning 

We formally define disjoinf free parfifioning as follows. 

Definition 6.1 Given a DFS tree T of an undirected graph G and a set U of failed vertices and edges, let 
Abe a vertex set in G -\-U. The disjoint tree partitioning defined by A is a partition of the subgraph of T 
induced by A into 

1. Aset of paths V such that (i) each path in V is an ancestor-descendant path in T and does not contain 
any deleted edge or vertex, and(ii) \V\ < \U\. 

2. A set of trees T such that each tree t £ T is a subtree ofT which does not contain any deleted edge 
or vertex. 

Note that for any ti,T 2 G T, there is no edge between ti and T 2 because T is a DFS tree. 

The disjoinf free parfifioning for sef A = H\{r} can be compufed as follows. Lef Vf wAEj respectively 
denofe fhe sef of failed verfices and edges associafed with the updates U. We initialize V = f and T = 
{T{w) I m is a child of r}. We refine fhe parfifioning by processing each vertex x G V/ as follows (see 
Figure [^(i)). 

• If x is presenf in some T' G T, we add fhe pafh from par{v) fo fhe roof of T' fo V. We remove T' 
from T and add all fhe subfrees hanging from fhis pafh fo T. 

• If x is presenf in some pafh p G "P, we splif p aiv info fwo pafhs. We remove p from V and add fhese 
fwo pafhs fo V. 

Edge deletions are handled as follows. We firsl remove edges from Ej fhaf donT appear in T. Processing 
of the remaining edges from Ej is quite similar to the processing of Vf as described above. For each edge 
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Figure 6: Disjoint tree partition for V \ {r}: (i) Initializing T = {T(a),T(h)} and V = %, (ii) Disjoint 
tree partition obtained after deleting the vertex g. (ii) Final disjoint tree partition obtained after deleting the 
edges (c, d) and (m, n). 


e G Ef \ just visualize deleting an imaginary vertex lying at mid-point of the edge e (see Figure [^(ii)). It 
takes 0(n) time to process any v ^ Vf and e £ Ef. 

Note that each update can add at most one path to V. So the size of V is bounded by \U\. The fact that 
T is a DFS tree of G ensures that no two subtrees in T will have an edge between them. SoV UT satisfies 
all the conditions stated in Definition l6.ll 

Lemma 6.1 Given an undirected graph G with a DFS tree T and a set U of failing vertices and edges, we 
can find a disjoint tree partition of set V \ {r} in 0{n\U\) time. 


7 Fault tolerant DFS Tree 


We first present a fault tolerant algorithm for a DFS tree. Let (7 be a given set of failed vertices or edges in 
G. In order to compute the DFS tree T* for G + U, our algorithm first constructs a disjoint tree partition 
{T,V) for L\{r-} defined by the updates U (see Lemma 6.11. Thereafter, it can be visualized as the static 
DFS traversal on the graph whose (super) vertices are the elements ofVuT. Note that our notion of super 
vertices is for the sake of understanding only. 

Consider the stack-based implementation of the static algorithm for computing a DFS tree rooted at a 
vertex r in graph G (refer to Figure|^i)). Our algorithm for computing DFS tree for G + U (refer to Figure 
l^ii)) is quite similar to the static algorithm. The only points of difference are the following. 


• In the static DFS algorithm whenever a vertex is visited, it is attached to the DFS tree and pushed into 
the stack S. In our algorithm when a vertex u in some super vertex Vs G "P U T is visited, a path 
starting from u is extracted from Vg and attached to the DFS tree, and this entire path is pushed into 
the stack S. 


• Instead of scanning the entire adjacency list N(w) of a vertex w, the reduced adjacency list L(w) is 
scanned. 


When a path is extracted from a super vertex Vg, the remaining unvisited part of Vg is added back 
toT U P. However, we need to ensure that the properties of disjoint tree partitioning are satisfied in the 
updated PUP. This is achieved using Procedure DFS-in-Path and Procedure DFS-in-Tree, which also build 
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Procedure Static-DFS(G, r): Static algo¬ 
rithm to compute a DFS tree of G rooted at 


r. 

1 Stack 5^0; 

2 Push{r); 

3 status{r) ^ visited; 

4 while S 7 ^ empty do 

5 

6 

7 

8 
9 

10 


11 

12 

13 

14 

15 


w ^ Top{S); 

if N{w) = 0 then Pop{w); 
else 

u -s— First vertex in N{w); 
Remove u from N{w); 
if status{u) = unvisited then 


w; 


par{u) 
status{u) e- visited; 
Push{u); 


end 
end 


16 end 


Procedure Dynamic-DFS(G, C/, r): Algorithm 
for updating the DFS tree T rooted at r for the 
graph G PU. 


1 Stack 5^0; {T,V) 

2 Push{r); 

3 status{r) -ir- visited; L{r) 

4 while S 7 ^ empty do 


PartitioniT, U); 
N{r); 


5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 


w Top{S); uq w; 
if L{w) = 0 then Pop{w); 
else 

u e- First vertex in L{w); 

Remove u from L{w); 
if status {u) = unvisited then 
if iNFO(u) = tree then 

^ DFS-in-Tree(u); 
else if iNFO(u) = path then 

DFS-in-Path(u); 

end 

for 1 = 1 to t do 

par{ui) ^Ui-i; 
status{ui) ^ visited; 
Push{ui); 

end 
end 
end 


23 end 


(i) 


(ii) 


Figure 7: The static (and dynamic) algorithm for computing (updating) a DFS tree. The key differences are 
shown in blue. 


the reduced adjacency list for the vertices on the path. The construction of a sparse reduced adjacency list is 
inspired by Lemma [3d] (Component property) which can be reformulated in the context of our algorithm as 
follows. 

Property 7.1 When a path p is attached to the partially constructed DFS tree T* during the algorithm, for 
every edge {x, y), where x € p and y belongs to the unvisited graph the following condition holds. Either 
y is added to L{x) or y' is added to L{x') for some edge {x', y') where x' is a descendant (not necessarily 
proper) of x in p and y' is connected to y in the unvisited graph. 

We now describe how the properties of disjoint tree partitioning and hence Property [7.1| are maintained 
by our algorithm when a vertex v £ Vg is visited by the traversal. 

1. Let Vs = path{x, y) £ V. Exploiting the flexibility of DFS, we traverse from v to the farther end of 
path{x, y). Now path{x, y) is removed from V and the untraversed part of path{x, y) (with length 
at most half of \path{x, y)|) is added back to V. We refer to this as path halving. This technique 
was also used by Aggarwal and Anderson O in their parallel algorithm for computing DFS tree in 
undirected graphs. Notice that \ V\ remains unchanged or decreases by 1 after this step. 

2. Let Vs = T £ T. Exploiting the flexibility of a DES traversal, we traverse the path from v to the root 
of r (say x) and add it to T*. Thereafter, r is removed from T and all the subtrees hanging from this 
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r 



Figure 8: Visiting a super vertex from TUV. (i) The algorithm visits T{a) £ T using the edge (r, e) and 
the path{n, t) £V using the edge (r, q). (ii) Traversal extracts path{e, a) and path{q, n) and augment it to 
T*. The unvisited segments are added back to T and V. 


path are added to T. Observe that every newly added subtree is also a subtree of the original DFS tree 
T. So the properties of disjoint tree partitioning are satisfied after this step as well. 

Let path{v, x) be the path extracted from Vg- For each vertex w in this newly added path, we compute 
L{w) ensuring Property [7.1| as follows. 

(i) For each path p £ V, among potentially many edges incident on w from p, we just add any one edge. 

(ii) For each tree r' £ T, we add at most one edge to L as follows. Among all edges incident on r' from 
path{v, x), if (u), z) is the edge such that w is nearest to x on path{v, x), then we add z to L{w). 
However, for the case Vg £ T, we have to consider only the newly added subtrees in T for this step. 
This is because the disjoint tree partitioning ensures the absence of edges between Vg and any other 
tree in T. 

Figure [^provides an illustration of how 7” U "P is updated when a super vertex in T U P is visited. 

7.1 Implementation of our Algorithm 

We now describe our algorithm in full detail. Firstly we delete all the failed edges in U from the data 
structure V. Now, the algorithm begins with a disjoint tree partition (P, P) which evolves as the algorithm 
proceeds. The state of any unvisited vertex in this partition is captured by the following three variables. 
-info(m): this variable is set to tree if u belongs to a tree in T, and set to path otherwise 
-ISROOT(n): this variable is set to True if v is the root of a tree in T, and False otherwise. 
-PATHPARAM(r;): if v belongs to some path, say path{x, y), in P, then this variable stores the pair (x, y), 
and null otherwise. 

Procedure Dynamic-DFS : For each vertex v, status{v) is initially set as unvisited, and L{v) is initialized 
to 0. First a disjoint tree partition is computed for the DFS tree T based on the updates U. The procedure 
Dynamic-DFS then inserts the root vertex r into the stack S. Now while the stack is non-empty, the pro¬ 
cedure repeats the following steps. It reads the top vertex from the stack. Let this vertex be w. If L{w) is 
empty then w is popped out from the stack, else let u be the first vertex in L{w). If vertex u is unvisited till 
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Procedure DFS-in-Tree(M): DFS traversal 
enters from node u and exits from v, the root 
of the tree containing node u in set T. 


1 •(— m; 

2 while IsRoOT(ti) ^ True do 

3 I u par{v) 

4 end 


5 IsRooT(ti) t- False-, 

6 T^r\T{v)-, 


7 

8 
9 

10 

11 

12 

13 

14 


{wi ,... ,uit) t- path{u,v)-, 
for i = 1 to t do 

foreach path{x, y) £ V do 
if Query{u)i,x,y) 7 ^ 0 then 
{w^,z) t- Query{wi,x,y)-, 
L{wi) ^ L{wi) U {z}-, 
end 
end 


15 

16 

17 

18 

19 

20 


foreach child w of Wi except Wi-i do 
{y,z) ^ Query{T{w),v,u)-, 

/* where y £ path{u,v) 

L{y) ^ L{y) U {z}-, 

T^TUT{w)-, 

IsRoot(z(;) t- True; 
end 


21 end 

22 Return pat/i(u, v); 


*/ 


Procedure DFS-in-Path(M): DFS traversal enters 
from node u and exits from v, the farther end of 
path containing node u in set V. 

1 (u, d) t-P athParam(u); 

2 if distT{u,d) > distT{u,v) then Swap(v,d)-, 

3 c Neighbor of u on path{v, d) nearer to d; 

4 "P {V\path{v,d))Upath{c,d)-, 

5 for c' G path(e, d) do 

6 I PathParam(c') t-(c, d); 

7 end 


8 

9 

10 

11 

12 

13 

14 

15 

16 


(wi,... ,Wt) t- path{u,v)-, 
for i = 1 to t do 

foreach path{x, y) £ V do 
if Query{wi,x,y) 7 ^ 0 then 
{wi,z) t- Query{u)i,x,y)-, 
L(wi) t- L{wi) U {z}-, 
end 
end 
end 


17 

18 

19 

20 
21 


foreach T{w) G T do 

if Query{T{w),v,u) 7^ 0 then 
{y,z) ^ Query{T{w),v,u)-, 
/* where y G path{u, v) 

L{y) ^ L{y) u {z}-, 

end 


22 end 

23 Return poth(u, u); 


*/ 


Figure 9: The pseudo-eode of Proeedures DFS-in-Tree and Proeedures DFS-in-Path. 


now, then depending upon whether u £ T or u £ V, Proeedure DFS-in-Tree or DFS-in-Path is exeeuted. 
A path po is then returned to Proeedure Dynamie-DFS where for eaeh vertex of po parent is assigned and 
status is marked visited. The whole of this path is then pushed into staek. The proeedure proeeeds to the 
next iteration of While loop with the updated staek. 

Procedure DFS-in-Tree : Let vertex u is present in tree, say T{v), in T (the vertex v ean be found easily 
by seanning the aneestors of u and eheeking their value of IsROOT). The DFS traversal enters the tree from 
u and leaves from the vertex v. Let path{u, v) = {wi = u,W 2 ■ ■ ■ ,wt = v). The path{u, v) is pushed into 
staek and attaehed to the partially eonstrueted DFS tree T*. We now update the partition ('P, T) and also 
update the redueed adjaeeney list for eaeh Wi present on path{u, v) as follows. 

1. For eaeh vertex Wi and every path path{x, y) £ V, we perform Query{wi,x, y) on the data strueture 
V that returns an edge {wi, z) sueh that z G path{x, y). We add 2: to L{wi). 

2. Reeall that sinee subtrees in T do not have any eross edge between them, therefore, there eannot be 
any edge ineident on path{u, v) from trees whieh are already present in T. An edge ean be ineident 
only from the subtrees whieh was hanging from path{u, v). T{v) is removed from T and all the 
subtrees of T{v) hanging from path{u, v) are inserted into T. For eaeh sueh subtree, say r, inserted 
into T, we perform Query{T, u, v) on the data strueture V that returns an edge, say {y, z), sueh that 
z £ T and y is nearest to u on path{u, v). We insert 2; into L{y). 
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Procedure DFS-in-Path : Let vertex u visited by the DFS traversal lies on a path{v, y) G V. Assume 
distT{u, v) > distT{u, y). The DFS traversal travels from u to u (the farther end of the path). The path 
path{v,y) in set V is replaced by its subpath that remains unvisited. The reduced adjacency list of each 
w G path{u, v) is updated in similar way as in the procedure DFS-in-Tree except that in step 2, we perform 
Query{T, u, v) for each t e T. 

The reader may refer to Figure]^ for pseudo code of Procedures DFS-in-Tree and DFS-in-Path. This 
completes the description of the fault tolerant algorithm for DFS tree. This algorithm maintains Property 


7.1 at each stage by construction given that the properties of disjoint tree partitioning are satisfied. 


7.2 Correctness 

It can be seen that the following two invariants hold for the while loop in the Procedure Static-DFS described 
in Figure|^(i). It is easy to see that these invariants imply the correctness of the algorithm, i.e., the generated 
tree is a rooted spanning tree where every non-tree edge is a back edge. 

Ii: The sequence of vertices in the stack from bottom to top constitutes an ancestor-descendant path from 
r in the DFS tree computed. 

I 2 : For each vertex v that is popped out, all vertices in the set N(v) have already been visited. 

These two invariants Ii and I 2 also hold for Procedure Dynamic-DFS described in Figure (ii) as 
follows. Invariant Ii holds by construction as described in our algorithm. Following lemma proves that 
invariant I 2 is maintained by our algorithm since it follows Property [7.1| by construction. 

Lemma 7.1 If Property \7.1\ is maintained by the procedure Dynamic-DFS, then invariant I 2 will hold true 
at each stage of the algorithm. 

Proof: We give a proof by contradiction as follows. Assume that x is the first vertex that is popped out of 
the stack before some vertex y £ N{x) is visited. Consider the time when a path p containing x was pushed 
in the stack. Clearly y ^ L{x), hence using Property [7.l| we know that some y' G L{x') is connected to y in 
the unvisited graph where x' is a descendant (not necessarily proper) of x in p. Let p* be a path between y' 
and y in the unvisited graph. 

Now consider the time when x is popped out of the stack. Clearly all its descendants have been popped 
out, so y' has been visited by the traversal. Thus, p* can be divided into two non-empty sets A and B 
denoting visited and 

unvisited vertices of p* respectively. Here y' £ A and y £ B, thus clearly for some vertex in A invariant I 2 
is not satisfied. This confradicfs our assumpfion fhat x is fhe firsf verfex fhaf is popped ouf of fhe sfack for 
which I 2 is nof safisfied. Thus, mainfenance of Properfy [7 .1 1 ensures fhe invarianf I 2 in our algorifhm. □ 

Hence, our algorifhm indeed compufes a valid DFS free for G U. 


7.3 Time complexity analysis 

As described earlier fhe disjoinf free parfifioning and fhe componenfs properfy play a key role in fhe effi¬ 
ciency of our algorifhm. They allow us fo limif fhe size of fhe reduced adjacency lisfs L, fhaf are builf during 
fhe algorifhm. Our algorifhm compufes T* by performing a DFS fraversal on fhe reduced adjacency lisf L. 
Thus, fhe fime complexify of our algorifhm is 0(n \L\) excluding fhe fime required fo compufe L. 

We firsf esfablish a bound on fhe size of L. In each sfep our algorifhm exfracfs a pafh from Vg £ V UT 
and affaches if fo T*. Lef Pt and Pp denofe fhe sef of such pafhs fhaf originally belonged fo some free in T 
and some pafh in V, respectively. For every pafh pQ £ PtU Pp our algorifhm performs fhe following queries 
on D. 
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(i) For each vertex w in pQ, we query each path in V for an edge incident on the vertex w. Thus, the total 
number of edges added to L by these queries is 0{n\V\). 

(ii) If po belongs to Pp, then we query for an edge from each t £ T to pQ. It follows from the path halving 
technique that each path in V reduces to at most half of its length whenever some path is extracted 
from it and attached to T*. Hence, the size of Pp is bounded by I'Pj log n. 

(iii) If pq belongs to Pt, then we query for an edge from only those subtrees which were hanging from pQ. 
Note that these subtrees will now be added to set T. Hence, the total number of trees queried for this 
case will be bounded by number of trees inserted to T. Since each subtree can be added to T only 
once, these edges are bounded by 0{n) throughout the algorithm. 

Thus, the size of L is bounded by O (n(l + jT* |) log n). Since each edge added to L requires querying the 
data structure V which takes 0(log^ n) time, the total time taken to compute L is 0[n{l-\-\V\ logn) log^ n). 
Thus, we have the following lemma. 


Lemma 7.2 An undirected graph can be preprocessed to build a data structure of 0{m tog n) size such 
that for any set U of k failed vertices or edges (where k < n), the DFS tree of G + U can be reported in 
0(n(l + \V\ log n) log^ n) time. 


From Definition 6.1 we have that |7^| is bounded by \U\. Thus we have the following theorem. 


Theorem 7.1 An undirected graph can be preprocessed to build a data structure of 0{mlogn) size such 
that for any set U of k failed vertices or edges (where k < n), the DFS tree of G + U can be reported in 
0(nk log^ n) time. 


It can be observed that Theorem |7 .1 1 directly implies a data structure for fault tolerant DFS tree. 


7.4 Extending the algorithm to handle insertions 

In order to update the DFS tree, our focus has been to restrict the number of edges that are processed. For 
the case when the updates are deletions only, we have been able to restrict this number to 0(nk log n), for 
a given set of k updates (failure of vertices or edges). We now describe the procedure to handle vertex and 
edge insertions. Let Vj be the set of the vertices inserted, and Ei be the set of edges inserted, (including 
the edges incident to the vertices in Vi). If there are k vertex insertions, the size of Ej is bounded by nk. 
So even if we add all the edges in Ej to the reduced adjacency lists, the size of L would still be bounded by 
0(nk log n). Hence, we perform the following two additional steps before starting the DFS traversal. 

• Initialize L{v) to store the edges in Ej instead of 0. That is, L(v) £- {y \ (y, v) £ Ej} 

• Each newly inserted vertex is treated as a singleton path and added to P. That is, P £- PU{x\x G V/}. 

In order to establish that our algorithm, after incorporating the insertions, correctly computes a DFS tree 
of G + U, we need to ensure that all the edges essential for DFS traversal as described in Property |7.1| are 
added to L. All the essential edges from G are added to L during the algorithm itself. In case an essential 
edge belongs to Ej, the edge has already been added to L during its initialization. Note that the time taken 
by our algorithm remains unchanged since the size of L remains bounded by 0{nk log n). This completes 
the proof of our main result stated in Theorem |1.1| 

Let us consider the case when U consists of edge insertions only. In this case P will be an empty set. 
As discussed above, we initialize the reduced adjacency lists using Ej whose size is equal to \U\. Hence, 
Lemma [7^ implies the following theorem. 

Theorem 7.2 An undirected graph can be preprocessed to build a data structure of 0{mlogn) size such 
that for any set U of k edge insertions (where k < n), the DFS tree of G + U can be reported in 0{n log^ n) 
time. 
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8 Fully dynamic DFS 


We now describe the overlapped periodic rebuilding technique to convert our algorithm for computing a 
DFS tree after k updates to fully dynamic and incremental algorithms for maintaining a DFS tree. Similar 
technique was used by Thorup iHTt for maintaining fully dynamic all pairs shortest paths. 

In the fully dynamic model, we need to report the DFS tree after every update in the graph. Given the 
data structure V built using the DFS tree of the graph G, we are able to report the DFS tree oiG + U after 
\U\ = k updates in 0{nk) time. This becomes inefficient if k becomes large. Rebuilding V after every 
update is also inefficient as it takes 0{m) time to build V. Thus, it is better to rebuild V after every \U'\ = c 
updates for a carefully chosen c. Let V' be the data structure built using the DFS tree of the updated graph 
G + U' with \ U'\ = c. V can thus be used to process the next c updates efficiently (see Figure(a)). The 
cost of building V' can thus be amortized over these c updates. 

To achieve an efficient worst case update time, we divide the building of V' over the first c updates. This 
V is then used by our algorithm in the next c updates, during which a new V” is built in a similar manner 
and so on (see Figure[^(b)). The following lemma describes how this technique can be used in general for 
any dynamic graph problem. For notational convenience we denote any function f{m,n) as /. 

Lemma 8.1 Let D be a data structure that can be used to report the solution of a graph problem after a set 
ofU updates on an input graph G. If D can be build in 0{f) time and the solution for graph G -\-U can be 
reported in 0{h + \U\ x g) time, then D can be used to report the solution after every update in worst case 
0{y/Jg + h) update time, given that ^f jg < n. 


Build 

Vq 

i 

Build 

i 

Build Build 

U2 L)^ 

i i 

Build 

Vq 

i 

Build 

V, 

Build 

V2 

Build 

V3 

Ml 

Uc 

U2c U3c 

Ml 

Uc 

U2c 

U3c 

Use 

Use 

Use Use 


Use 

Use Use 

Vq 

Vi 

'L>2 Vs 


Vo 

Vi 

V2 


(a) (6) 

Figure 10: (a) Fully dynamic algorithm with amortized update time, (b) De-amortization of the algorithm. 

Proof: We first present an algorithm that achieves amortized 0{^/Jg + h) update time. It is based on the 
simple idea of periodic rebuilding. Given the input graph Go we preprocess it to compute the data structure 
Dq over it. Now let ui, ..., Uc (c < n) be the sequence of first c updates on Go- To report the solution after 
update we use Dq to compute the solution for Go + {ui,..., Ui}. This takes 0{h + {i x g)) time. So 
the total time for preprocessing and handling the first c updates is 0{f + Yli=i h + {i x g)). Therefore, the 
average time for the first c updates is 0(//c + c x g + h). Minimizing this quantity over c gives the optimal 
value Co = sj f / g which is bounded by n. So, after every cq updates we rebuild our data structure and use it 
for the next cq updates (see Figure [T0|^a)). Substituting the value of co gives the amortized time complexity 
as 0{yJJg + h ). 

The above algorithm can be de-amortized as follows. Let Gi, G 2 , G 3 ,... be the sequence of graphs 
obtained after cq, 2co, 3co, .. updates. We use the data structure Dq built during preprocessing to handle the 
first 2co updates. Also after the first cq updates we start building the data structure Di over Gi. This Di is 
built in Co steps, thus the extra time spent per update is f /cq = 0{^/Jg) only. We use Di to handle the next 
Co updates on graph G 2 , and also in parallel compute the data structure D 2 over the graph G 2 . (See Figure 
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[TOj^b)). Since the time for building each data structure is now divided in cq steps, we have that the worst case 

update time as 0(-y/75 + ^ )■ D 


The above lemma combined with Theorems o and |7.2| directly implies the following results for the 
fully dynamic DFS tree problem and the incremental DFS tree problem, respectively. 

(For the theorem below we use f = m log n, g = n log^ n and h = 0.) 

Theorem 8.1 There exists a fully dynamic algorithm for maintaining a DFS tree in an undirected graph 
that uses 0(m log n) preprocessing time and can report a DFS tree after each update in the worst case 
0{y/mn log^'^ n) time. An update in the graph can be insertion / deletion of an edge as well as a vertex. 

(For the theorem below we use / = mlogn, g = logn and h = nlog^n, since O(nlog^n) = 
0{k X g + h)fov k < n). 

Theorem 8.2 There exists an incremental algorithm for maintaining a DFS tree in an undirected graph that 
uses 0{m log n) preprocessing time and can report a DFS tree after each edge insertion in the worst case 
0{n log^ n) time. 

9 Applications 

Our fully dynamic algorithm for maintaining a DFS tree can be used to solve various dynamic graph prob¬ 
lems such as dynamic subgraph connectivity, biconnectivity and 2-edge connectivity. Note that these prob¬ 
lems are solved trivially using a DFS tree in the static setting. Let us now describe the importance of our 
result in the light of existing results for these problems. 

9.1 Existing Results 

The dynamic subgraph connectivity problem is defined as follows. Given an undirected graph, the status of 
any vertex can be switched between active and inactive in an update. The goal is to efficiently answer any 
online connectivity query on the subgraph induced by the active vertices. This problem can be solved by 
using dynamic connectivity data structures ifTTlfT^lTTlI^ that answer connectivity queries under an online 
sequence of edge updates. This is because switching the state of a vertex is equivalent to 0{n) edge updates. 
Chan lO introduced this problem and showed that it can be solved more efficiently. He gave an algorithm 
using FMM (fast matrix multiplication) that achieves 0(m° ®^) amortized update time and query 

time. Later Chan et al. ifTOll presented a new algorithm that improves the amortized update time to 
They also mentioned the following among the open problems. 

1. Is it possible to achieve constant query time with worst case sublinear update time ? 

2. Can non trivial updates be obtained for richer queries such as counting the number of connected 
components ? 

Duan IT^ answered the first question affirmatively but at the expense of a much higher update time. 
He presented an algorithm with 0(m^/^) worst case update time and update time, improving the 

worst case bounds for the problem. Kapron et al. |[^ presented a randomized algorithm for fully dynamic 
connectivity which takes 0(1) time per update and answers the query correctly with high probability in 
0(1) time, giving a Monte Carlo algorithm for subgraph connectivity with worst case 0(n) update time. 
Thus their result answered the first question in a randomized setting. However, in the deterministic setting 
both these questions were still open. Our result answers both these questions affirmatively for the deter¬ 
ministic setting as well. Our fully dynamic algorithm directly provides an 0{y/mn) update time and 0(1) 
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query time algorithm for the dynamic subgraph connectivity problem. Our algorithm maintains the number 
of connected components simply as a byproduct. In fact, our fully dynamic algorithm for DFS tree solves 
a generalization of dynamic subgraph connectivity - in addition to just switching the status of vertices, it 
allows insertion of new vertices as well. Hence the existing results offer different trade-offs between the up¬ 
date time and the query time, and differ on the types (amortized or worst case) of update time and the types 
(deterministic or randomized) of query time. Our algorithm, in particular, improves the deterministic worst 
case bounds for the problem (see Figure [TT]). Further, unlike all the previous algorithms for dynamic sub¬ 
graph connectivity, which use heavy machinery of existing dynamic algorithms, our algorithm is arguably 
much simpler and self contained. 
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Figure 11: Current-state-of-the-art of the algorithms for the dynamic subgraph connectivity. 

Exploiting the rich structure of DFS trees, we also obtain 0{^Jmn) update time algorithms for dynamic 
biconnectivity and dynamic 2-edge connectivity under vertex updates in a seamless manner. These problems 
have mainly been studied in the dynamic setting under edges updates only. Some of these results also allow 
insertion and deletion of isolated vertices. Our result, on the other hand does not impose any such restriction 
on insertion or deletion of vertices. Figure illustrates our results and the existing results in the right 
perspective. 
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Figure 12: Current-state-of-the-art of the algorithms for the dynamic biconnectivity (*) and dynamic 2-edge 
connectivity (f) under vertex updates. 

We now describe how our algorithm can be used to solve these problems. 

9.2 Algorithm 

The solution of dynamic subgraph connectivity follows seamlessly from our fully dynamic algorithm as fol¬ 
lows. As mentioned in Section]^ we maintain a DFS tree rooted at a dummy vertex r, such that the subtrees 
hanging from its children corresponds to connected components of the graph. Hence, the connectivity query 
for any two vertices can be answered by comparing their ancestors at depth two (i.e. children of r). This 
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information can be stored for each vertex and updated whenever the DFS tree is updated. Thus, we have a 
data structure for subgraph connectivity with worst case 0{y/mn) update time and 0(1) query time. Our 
algorithm fully dynamic DFS can be extended to solve fully dynamic biconnectivity and 2-edge connectivity 
under both edge updates and vertex updates as follows. 

A set S of vertices in a graph is called a biconnected component if it is maximal set of vertices such 
that on failure of any vertex w in S, the vertices of 5 \ {re} remains connected. Similarly, a set S is said 
to be 2-edge connected component if it is maximal set of vertices such that failure of any edge with both 
end points in S does not disconnect any two vertices in S. The goal is to maintain a data structure that can 
efficiently answer the queries of biconnectivity and 2-edge connectivity in the dynamic setting. 

These queries can be answered easily by finding articulation points and bridges of the graph. It can be 
shown ifT^ that two vertices belong to same biconnected component if and only if the path connecting them 
in a DFS tree of the graph does not passes through an articulation point. Similarly, two vertices belong to 
same 2-edge connected component if and only if the path connecting them in a DFS tree of the graph does 
not have a bridge. The articulation point and bridge in a graph can be defined as follows: 

Definition 9,1 Given a graph G(V, E), a vertex v € V is called an articulation point of G if there exist a 
pair of vertices x,y € V such that every path between x and y in G passes through v. 

Definition 9.2 Given a graph Gif/, E), an edge e £ E is called a bridge ofG if there exist a pair of vertices 
x,y € V such that every path between x and y in G passes through e. 

The articulation points and bridges of a graph can be easily computed by using DFS traversal of the 
graph. Given a DFS tree T of an undirected graph G, we can index the vertices in order they were visited 
by the DFS traversal. This index is called the DFN number of the vertex. The high number of a vertex v is 
defined as the lowest DFN number vertex from which there is an edge incident to T{v). Now any non-root 
vertex v will be an articulation point of the graph if high number of at least one of its children is equal to 
DFN{v). The root r of the DFS tree T will be an articulation point if it has more than one child. Any edge 
(x, y) (say x = par{y)) of the DFS tree will be a bridge if the high number of y is DFN{x) and the high 
number of each child of y (if any) is equal to DFN{y). Thus, given the high number of each vertex in the 
DFS tree, the articulation points and bridges can be determined in 0(n) time. 

We now show that given any set of k updates to graph G, we can not only construct the new tree T*, 
but also compute high point of each vertex in 0{nklog^ n) time. For each vertex x, let a{x) denote the 
highest ancestor of x in T* such that (x, a(x)) is an edge in G -h E. Note that if (x, a(x)) is a newly added 
edge, then it can be easily computed by scanning all the new edges added to the graph. This is due to fact 
that the total number of new edges added to G is bounded by nk. So we restrict ourselves to the case when 
(x, a(x)) was originally present in the graph G. Recall that our algorithm computed T* by attaching paths 
to the partially grown tree, where Ft and Fp are the set of paths attached to T* (during its construction) that 
originally belonged to T and F respectively. Also, path halving ensures that the size of Fp is bounded by 
k log n. For each path pQ ^ FtVJ Fp, let H (po) denote the vertex in po that is closest to r in T*. 

We now give an 0{nk log^ n) time algorithm for constructing a subset A{x) of neighbors of x such that 
the following condition holds. 

• For a vertex x, if o(x) ^ A{x), then there is some descendant y of x in T* such that a (x) G A{y). 

It is easy to see that if we get such an A{x) corresponding to each x, then high number of each vertex can 
be computed easily by processing the vertices of T* in bottom-up manner. Now depending upon whether 
paths containing x and a(x) belong to set Fp or Ft, we can have different cases. We show the construction 
of set A[x) along with the case it handles in following four steps. 

1. Vertex a(x) lies on a path in Fp 
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Figure 13: (i) Before the beginning of algorithm vertex x belongs to tree Tq € T, z is the highest ancestor 
of X in To such that (x, z) is an edge, (ii) The partitioning changes as the algorithm proceeds, Ti(g T) is the 
tree containing vertex z just before it is attached to T*. (iii) A path containing vertex z (i.e. pz) is extracted 
from Ti and attached to T*. If a{x) belongs to Tq, then it is the highest neighbor of x in pz- 


Consider a vertex x in T*. For each path po G Pp we query our data structure for an edge (rt, x) such 
that the end point u is closest to H{pq) on path pQ, and add u to A{x). Note that if a(x) lies on po> 
then u will be same as o(x). 

2. Vertex x lies on a path in Pp 

For each u £ T* and po G Pp, we query our data structure for an edge {u, y) such that the end point 
y is farthest from H{pq) on path po- We add u to A{y). Now consider a vertex x on po such that 
a(x) = u. If X is equal to y, then we have added a(x) (i.e. u) to A{x). If x is not equal to y, then we 
have added a(x) (i.e. u) to A{y) where y is descendant of x in T*. 

3. Vertex x and a(x) lies on same path in Pt 

Consider a vertex x lying on path po G Pt- We query our data structure for an edge {u, x) such that 
the end point u is closest to H{po) on path po, and add u to A{x). Note that if a(x) also lies on po, 
then u will be same as o(x). 

4. Vertex x and a(x) lies on different paths in Pt 

We first note that x and a(x) would belong to same tree (say Tq) in T, since disjoint tree partitioning 
ensures the absence of edges between two subtrees in T. Let z be the highest ancestor of x in Tq such 
that (x, z) is an edge mG + U. Let pz be the path in Pt containing vertex z. 

We now prove that a(x) belongs to pz- Recall that as the algorithm proceeds, our partitioning P UT 
evolves with time. Let Ti be the tree in T containing vertex z just before Pz is attached to T*. Then 
Ti is either same as Tq, or a subtree of Tq (see Figure [13] (i)). Also, a(x) must lie in tree Ti, since it 
cannot be an ancestor of z in Tq. Now let T 2 be the tree containing x which is obtained on removal 
of Pz from Ti- Since z is an ancestor of x in Tq, the vertices in T 2 will eventually hang from some 
descendant of z (not necessarily proper) in T*. For a(x) to be the highest neighbor of x in T*, it 
should be an ancestor of z in T* which is only possible if a(x) G Pz- 

Therefore, for each vertex x belonging to a tree Tq in T, we calculate the highest ancestor of z of 
X in To which has an edge incident from Tq. Now once T* is constructed let Pz{£ T)) be the path 
containing vertex z. We query our data structure for an edge {u, x) such that the end point u is closest 
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to H{pz) on path pz, and add u to A{x). Note that if a{x) also lies in Tq, then u must be same as a{x) 
(see Figure [T^ (hi)). 

Note that the total time taken by the first two steps is bounded by 0{nk log^ n) and by the last two steps 
is bounded by 0(n log^ n). Thus, we have the following theorem. 

Theorem 9.1 Given an undirected graph G{V,E) with |y| = n and \E\ = m, we can maintain a data 
structure for answering queries of biconnected components and 2 edge connectivity in a dynamic graph 
which takes 0{^fmnlog^'^ n) update time, 0(1) query time and 0(m log n) time for preprocessing. 

10 Lower Bounds 

We now prove two conditional lower bounds for maintaining DFS tree under vertex updates and edge up¬ 
dates. 

10.1 Vertex Updates 

The lower bound for maintaining DFS tree under vertex updates is based on Strong Exponential Time 
Hypothesis (SETH) as defined below: 

Definition 10.1 (SETH) For every e > 0, there exists a positive integer k, such that SAT on k—CNF for¬ 
mulas on n variables cannot be solved in 0(2^^“*^^"^) time. 

Given an undirected graph G on n vertices and m edges in a dynamic environment (incremental / decre- 
mental or fully dynamic) under vertex updates. The status of any vertex can be switched between active and 
inactive in an update. The goal of subgraph connectedness is to efficiently answer whether the subgraph 
induced by active vertices is connected. Abboud and WilliamslUl proved a conditional lower bound of D(n) 
per update based on SETH for answering dynamic subgraph connectedness queries. They proved that any 
algorithm for answering dynamic subgraph connectedness queries using arbitrary polynomial preprocessing 
time and amortized update time would essentially refute the SETH conjecture. They also proved 

that any algorithm for maintaining partially dynamic (incremental/decremental) subgraph connectedness us¬ 
ing arbitrary polynomial preprocessing time and 0(n^“^) worst case update time would essentially refute 
the SETH conjecture. 

We present a reduction from subgraph connectedness to maintaining DES tree under vertex updates 
requiring the algorithm to report whether the number of children of the root in any DES tree of the subgraph 
is greater than 1. Thus we establish the following: 

Theorem 10.1 Given an undirected graph G with n vertices and m edges undergoing vertex updates, an 
algorithm for maintaining DFS tree (that can report the number of children of the root in the DFS tree) 
with preprocessing time p(m, n), update time u{m, n) and query time q{m, n) would imply an algorithm 
for subgraph connectedness with preprocessing time p{m -|- n, n), update time u{m -\- n, n) and query time 
q{m + n, n). 

Proof: Given the graph G for which we need to query for subgraph connectedness, we make a graph G' 
as follows. We add all vertices and edges of G to G'. Eurther add another vertex r called as pseudo root 
and connect it to all other vertices of G'. Thus G' has n -\- 1 vertices and m -\- n edges. Now in any DES 
tree T of G' rooted on r, the number of children of r will be equal to the number of components in G. 
Here subtrees rooted on each child of s represents a component of G. Any change on G can be performed 
on G' and query for subgraph connectedness in G is equivalent to querying if r has more than 1 child in T. □ 
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Thus any algorithm for maintaining fully dynamic DFS under vertex updates with arbitrary preprocess¬ 
ing time and 0(n^“'^) amortized update time would refute SETH. Also any algorithm for maintaining par¬ 
tially dynamic DFS under vertex updates with arbitrary preprocessing time and worst case update 

time would refute SETH. 

10.2 Edge Updates 

We now present a lower bound for maintaining a DFS tree under edge updates that applies on any algorithm 
which maintains tree edges of the DFS tree explicitly. In the following example we prove that there exists a 
graph G and a sequence of edge updates U, such that any DFS tree of the graph would require a conversion 
of D(n) edges from tree edges to back edges and vice-versa after every pair of updates in U. 





Figure 14: Worst Case Example for lower bound on maintaining DFS tree under fully dynamic edge updates. 

Consider the following graph for which a DFS tree rooted at r is to be maintained under fully dynamic 
edge updates. There are n/2 vertices that have edges to vertices x and y. The remaining n/2 — 3 

vertices ui,..., are connected in form of a line as shown in Figure [T4| At any point of time one of ui,..., 
(say vi) is connected to either x or y. The DFS tree for the graph is shown in Figure [^(i). Now, upon 
insertion of edge {vi, x) (say i = 2) and deletion of edge {vi,y) the DFS tree will transform to either Figure 
[n](ii) or Figure[^(iii). Clearly D(n) edges are converted from tree edge to back edge and vice-versa. This 
can be repeated for any Vi alternating between x and y ensuring that the new DFS tree is not exactly the 
same as some previous DFS tree (thus memorization of the complete tree will not help). Thus any algorithm 
maintaining tree edges explicitly takes D(n) time to handle such a pair of edge updates. 


11 Conclusion 


We have presented a fully dynamic algorithm for maintaining a DFS tree that takes worst case 0{^Jmn) 
update time. This is the first fully dynamic algorithm that achieves o{m) update time. In the fault tolerant 
setting our algorithm takes 0{nk) time to report a DFS tree, where k is the number of vertex or edge failures 
in the graph. We show the immediate applications of fully dynamic DFS for solving various problems such 
as dynamic subgraph connectivity, bi-connectivity and 2 edge connectivity. We also prove the conditional 


lower bound of D(n) on maintaining DFS tree under vertex/edge updates. Refer to 10 for details. 

DFS tree has been extensively used for solving various graph problems in the static setting. Most 
of these problems are also solved efficiently in the dynamic environment. However, their solutions have 
not used dynamic DFS tree. This paper is an attempt to restore the glory of DFS trees for solving graph 
problems in the dynamic setting as was the case in the static setting. We believe that our dynamic algorithm 
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for DFS, on its own or after further improvements/modifieations, would eneourage other researehers to use 
it in solving various other dynamic graph problems. 
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